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ABSTRACT 

Consider  a  MIMO  interference  channel  whereby  each  transmitter  and  receiver  are  equipped  with  multiple  antennas. 

The  basic  problem  is  to  design  optimal  linear  transceivers  (or  beamformers)  that  can  maximize  system  throughput.  The 
recent  work  \cite{Jafarl }  suggests  that  optimal  beamformers  should  maximize  the  total  degrees  of  freedom  and  achieve 
interference  alignment  in  high  SNR.  In  this  paper  we  first  consider  the  interference  alignment  problem  in  spatial 
domain  and  prove  that  the  problem  of  maximizing  the  total  degrees  of  freedom  for  a  given  MIMO  interference  channel 
is  NP-hard.  Furthermore,  we  show  that  even  checking  the  achievability  of  a  given  tuple  of  degrees  of  freedom  for  all 
receivers  is  NP-hard  when  each  receiver  is  equipped  with  at  least  three  antennas.  Interestingly,  the  same  problem 
becomes  polynomial  time  solvable  when  each  transmit/receive  node  is  equipped  with  no  more  than  two  antennas. 
Finally,  we  propose  a  distributed  algorithm  for  transmit  covariance  matrix  design,  while  assuming  each  receiver  uses  a 
linear  MMSE  beamformer.  The  simulation  results  show  that  the  proposed  algorithm  outperforms  the  existing 
interference  alignment  algorithms  in  terms  of  system  throughput. 
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Abstract 

Consider  a  MIMO  interference  channel  whereby  each  transmitter  and  receiver  are  equipped  with  multiple  antennas.  The  basic 
problem  is  to  design  optimal  linear  transceivers  (or  beamformers)  that  can  maximize  system  throughput.  The  recent  work  [1] 
suggests  that  optimal  beamformers  should  maximize  the  total  degrees  of  freedom  and  achieve  interference  alignment  in  high  SNR. 
In  this  paper  we  first  consider  the  interference  alignment  problem  in  spatial  domain  and  prove  that  the  problem  of  maximizing 
the  total  degrees  of  freedom  for  a  given  MIMO  interference  channel  is  NP-hard.  Furthermore,  we  show  that  even  checking  the 
achievability  of  a  given  tuple  of  degrees  of  freedom  for  all  receivers  is  NP-hard  when  each  receiver  is  equipped  with  at  least  three 
antennas.  Interestingly,  the  same  problem  becomes  polynomial  time  solvable  when  each  transmit/receive  node  is  equipped  with 
no  more  than  two  antennas.  Finally,  we  propose  a  distributed  algorithm  for  transmit  covariance  matrix  design,  while  assuming 
each  receiver  uses  a  linear  MMSE  beamformer.  The  simulation  results  show  that  the  proposed  algorithm  outperforms  the  existing 
interference  alignment  algorithms  in  terms  of  system  throughput. 


I.  Introduction 

Consider  a  multiuser  communication  system  in  which  a  number  of  users  must  share  common  resources  such  as  frequency, 
time,  or  space.  The  mathematical  model  for  this  communication  scenario  is  the  well-known  interference  channel ,  which  consists 
of  multiple  transmitters  simultaneously  sending  messages  to  their  intended  receivers  while  causing  interference  to  each  other. 
Interference  channel  is  a  generic  model  for  multiuser  communication  and  can  be  used  in  many  practical  applications  such  as 
Digital  Subscriber  Lines  (DSL)  [3],  Cognitive  Radio  (CR)  systems  [4]  and  ad-hoc  wireless  networks  [5],  [6], 

A  central  issue  in  the  study  of  interference  channel  is  how  to  mitigate  multiuser  interference.  In  practice,  there  are  several 
commonly  used  methods  for  dealing  with  interference.  First,  we  can  treat  the  interference  as  noise  and  just  focus  on  extracting 
the  desired  signals  (see  [15],  [21]).  This  approach  is  widely  used  in  practice  because  of  its  simplicity  and  ease  of  implementation, 
but  is  known  to  be  non-capacity  achieving  in  general.  An  alternative  technique  is  channel  orthogonalization  whereby  transmitted 
signals  are  chosen  to  be  nonoverlapping  either  in  time,  frequency  or  space,  leading  to  Time  Division  Multiple  Access,  Frequency 
Division  Multiple  Access  or  Space  Division  Multiple  Access  respectively.  While  channel  orthogonalization  effectively  eliminates 
multiuser  interference,  it  can  lead  to  inefficient  use  of  communication  resources  and  is  also  generally  non-capacity  achieving. 
Another  interference  management  technique  is  to  decode  and  remove  interference.  Specifically,  when  interference  is  strong 
relative  to  desired  signals,  a  user  can  decode  the  interference  first,  then  subtract  it  from  the  received  signal,  and  finally  decode 
its  own  message  (see  [8]  and  [11]).  This  method  is  less  common  in  practice  due  to  its  complexity  and  security  issues. 

In  a  cellular  system,  multi-cell  interference  management  is  a  major  challenge.  So  far  various  base  station  cooperation 
techniques  have  been  proposed  to  mitigate  inter-cell  interferences,  including  multi-point  coordinated  transmission,  or  network 
MIMO  transmission  [43^15].  Most  of  these  techniques  require  each  base  station  to  have  full/partial  channel  state  information 
(CSI)  as  well  as  the  knowledge  of  actual  independent  data  streams  to  all  remote  terminals.  With  the  complete  sharing  of  data 
streams  and  CSI,  the  multi-cell  scenario  is  effectively  reduced  to  a  single  cell  interference  management  problem  with  either 
total  [46]  or  per-group-of-antennas  power  constraints  [47],  [48].  While  these  techniques  can  offer  significant  improvement 
on  data  throughput,  they  also  have  several  drawbacks  including  stringent  requirement  on  base  station  coordination,  the  large 
demand  on  the  communication  bandwidth  of  backhaul  links,  and  the  heavy  computational  load  associated  with  the  increasing 
number  of  cells  [49],  [50]. 

Theoretically,  what  is  the  optimal  interference  management  strategy?  The  answer  is  related  to  the  characterization  of  capacity 
region  of  an  interference  channel,  i.e.,  determining  the  set  of  rate  tuples  that  can  be  achieved  by  the  users  simultaneously.  For 
the  noiseless  case,  the  capacity  region  and  the  optimal  precoding  strategy  of  the  two  user  interference  channel  is  discussed  in  [8] 
and  [7].  In  spite  of  intensive  research  on  this  subject  over  the  past  three  decades  (  [7]  -  [20]),  the  capacity  region  of  interference 
channels  is  still  unknown  for  general  case  (even  for  small  number  of  users).  The  lack  of  progress  to  characterize  the  capacity 
region  for  a  MIMO  interference  channel  has  motivated  researchers  to  derive  various  approximations  of  the  capacity  region.  For 
example,  the  maximum  total  degrees  of  freedom  (DoF)  corresponds  to  the  first  order  approximation  of  sum-rate  capacity  of  an 
interference  channel  at  high  SNR  regime.  Maximizing  this  approximation  of  sum-rate  leads  us  to  the  interference  alignment 
method  [1].  For  frequency  selective  channels,  interference  alignment  corresponds  to  correlated  signalling  across  different 
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frequency  tones.  This  linear  transceiver  scheme  for  interference  alignment  is  a  generalization  of  the  standard  OFDMA  scheme 
whereby  each  data  stream  is  transmitted  on  a  single  subcarrier,  which  corresponds  to  using  the  standard  unit  basis  vectors  { e, } 
(the  ?'-th  standard  unit  vector)  for  transmit  beamforming.  The  linear  transceiver  structure  for  interference  alignment  is  more 
general  since  it  does  not  require  diagonal  structure  nor  mutual  orthogonality  (two  transmit  covariance  matrices  X,  Y  are  said 
to  be  orthogonal  if  Tr(XF)  =  0). 

If  we  remove  mutual  orthogonality  condition  and  impose  only  diagonal  structure  on  transmit  covariance  matrices,  then  the 
interference  management  problem  is  reduced  to  the  dynamic  spectrum  management  problem  [40]  where  the  goal  is  to  find  the 
optimal  power  allocation  (i.e.,  optimal  diagonal  transmit  covariance  matrices)  which  can  maximize  system  throughput.  This 
problem  has  recently  been  a  topic  of  intensive  research  in  the  signal  processing  and  communications  communities.  For  diagonal 
matrix  channel  case  (e.g.  frequency  selective  scenario),  the  problem  of  maximizing  sum-rate  has  been  shown  to  be  NP-hard 
[40],  Several  algorithms  have  been  proposed  which  provide  varied  performance  in  different  channel  conditions.  These  include: 
Iterative  Waterfilling  Algorithm  IWFA  [22],  Successive  Convex  Approximation  Low  complExity  (SCALE)  algorithm  [39], 
Autonomous  Spectrum  Balancing  (ASB)  [29],  Optimal  Spectrum  Balancing  (OSB)  [24].  Furthermore,  different  algorithms  are 
proposed  for  the  case  when  the  channel  matrices  are  non-diagonal.  Authors  in  [23],  [25]  proposed  IWFA  based  algorithms  for 
power  allocation.  However,  these  selfish  approaches  work  well  only  in  low  SNR  cases  or  when  the  interference  is  low. 

Compared  to  the  networked  MIMO  approach,  interference  alignment  requires  less  information  exchange  among  transmitters, 
and  is  therefore  simpler  to  implement  in  practice.  Recently  two  iterative  algorithms  have  been  proposed  for  interference 
alignment  [2],  [41].  Both  appear  to  work  well  in  simulation  of  small  systems  (e.g.,  three  users,  each  equipped  with  two 
antennas).1  These  algorithms  require  system  users  to  first  specify  the  DoFs  for  all  receivers  and  then  attempt  to  achieve  them 
by  iteratively  aligning  the  interferences.  However,  these  algorithms  can  not  check  if  a  given  tuple  of  DoF  is  achievable,  nor  is 
there  any  guarantee  for  reaching  interference  alignment  even  when  the  given  tuple  of  DoF  is  achievable.  Moreover,  by  focusing 
only  on  high  SNR  regime  and  interference  alignment,  these  algorithms  do  not  attempt  any  power  allocations  across  different 
data  streams.  This  can  result  in  linear  transceivers  with  suboptimal  performance  at  low  to  intermediate  SNRs. 

In  this  paper,  we  consider  the  problem  of  maximizing  the  sum  of  DoFs  and  the  problem  of  checking  if  a  given  set  of 
DoFs  is  achievable  with  linear  transceivers.  We  study  the  complexity  status  of  both  of  these  problems  over  the  spatial  domain 
and  establish  their  NP-hardness.  These  results  suggest  that  the  two  existing  algorithms  for  interference  alignment  [2],  [41] 
cannot  converge  in  general.  We  also  propose  a  distributed  algorithm  to  design  linear  transceivers  for  interference  channels.  Our 
approach  is  based  on  using  MMSE  receivers  while  optimizing  transmit  covariance  matrices  for  all  transmitters.  We  maximize  the 
weighted  sum  of  a  utility  of  SINR’s  for  each  data  stream  and  use  iterative  convex  optimization/relaxation  to  compute  a  (local) 
optimal  solution.  The  utility  function  is  SINR/(1  +  SINR)  which  converges  to  1  when  SINR  — >  oo,  and  is  proportional 
to  SINR  when  the  SINR  value  is  small.  In  this  way,  maximizing  the  sum  of  utilities  for  all  data  streams  corresponds  to 
maximizing  the  total  DoF  when  the  noise  vanishes.  Simulations  show  that  our  algorithm  performs  well  in  all  SNR  regions  and 
can  deliver  far  superior  sum-rate  performance  than  the  existing  interference  alignment  algorithms  of  [2],  [41],  Compared  to 
the  networked  MIMO  approach  which  requires  sharing  of  data  streams,  our  linear  transceiver  design  algorithm  requires  less 
information  exchange:  at  each  iteration,  only  small  covariance  matrices  are  exchanged,  the  size  of  which  are  proportional  to 
the  antenna  numbers  at  each  transmitter  or  receiver. 


II.  System  Model 

Consider  a  Jv-user  MIMO  interference  channel  with  K  transmitter  -  receiver  pairs.  Let  H kj  be  an  AT  x  Mj  matrix 
representing  the  channel  gain  from  transmitter  j  to  receiver  fc,  where  Mj  and  Ay.  denote  the  number  of  antennas  at  transmitter  j 
and  receiver  k,  respectively.  The  received  signal  at  receiver  k  is  given  by 

K 

yk  =  kjXj  +  nfc, 

3= 1 

where  x7  is  an  Mj  x  1  random  vector  that  represents  the  transmitted  signal  of  user  j  and  ri/,:  ~  A/(0.  er2I)  is  a  zero  mean 
additive  white  Gaussian  noise. 

For  practical  considerations,  we  focus  on  optimal  linear  transmit  and  receive  strategies  that  can  maximize  system  throughput. 
In  particular,  suppose  transmitter  k  uses  a  beamforming  matrix  ~Vk  to  send  the  signal  vector  s k  to  its  intended  receiver  k.  At 
the  receiver  side,  receiver  k  estimates  the  transmitted  data  vector  sk  by  using  a  linear  beamforming  matrix  Ufc,  i.e., 

Xfc=Vfcsfc,  sfc  =  Ufcyfc, 

where  the  data  vector  sk  £  CdfcXl  is  normalized  so  that  i?[sfcS^]  =  I,  and  sk  is  the  estimate  of  sk  at  Ar-th  receiver.  VT  £  C MfcXdfc 
and  Ufe  £  CNkXdk  are  the  beamforming  matrices  at  the  transmitter  and  the  receiver  of  user  k,  respectively. 

^ven  though  the  two  algorithms  were  motivated  from  different  perspectives,  they  are  in  fact  algorithmically  identical. 


3 


It  is  known  that  the  problem  of  designing  optimal  beamformers  to  maximize  sum-rate  of  the  system  is  NP-hard  [40]  even 
in  the  single  transmit/receive  antenna  case.  Notice  that  recent  works  [1],  [2]  suggest  that  the  optimal  strategy  should  have 
interference  alignment  structure  in  the  high  SNR  regime.  Therefore,  we  are  led  to  find  a  linear  transmission-reception  strategy 
that  can  maximize  the  total  degrees  of  freedom.  In  the  next  section,  we  provide  the  complexity  analysis  of  this  problem. 


III.  NP-Hardness  of  Optimal  Interference  Alignment 

In  this  section,  we  show  that  for  a  given  channel,  not  only  the  problem  of  finding  the  maximum  DoF  is  NP-hard,  but  also 
the  problem  of  checking  the  achievability  of  a  given  tuple  of  DoF,  {d\, ...,  d k ) ,  is  NP-hard  when  there  are  at  least  3  antennas 
at  each  node. 

Notice  that  the  interference  alignment  conditions  in  the  fc-th  receiver  are 

UfeHfcjVj  =  0,  Wj^k,  (1) 

rank(u£HfcfcVfc)  =dfc.  (2) 

The  first  equation  guarantees  that  all  the  interference  is  in  the  subspace  orthogonal  to  U,t  while  the  second  one  assures  that 
the  signal  subspace  H/,:/i:V/,:  has  dimension  dk  and  is  linearly  independent  of  the  interference  subspace. 

In  the  sequel,  we  examine  the  solvability  of  above  interference  alignment  problem  (1)  -  (2)  in  two  different  cases. 


Theorem  1  For  a  K  user  MIMO  interference  channel,  maximizing  the  total  DoF,  namely, 

K 

max  y  dk 
(ufc.vfc}^=1  ^ 

s.t.  Ufc HfcjVj  =  0,  k  =  l,..,K,  j^k 
rank  (U^HfcfcVfc)  =  dk,  k=l,..,K 

is  NP-hard.  Moreover,  if  each  node  is  equipped  with  at  least  3  antennas,  then  the  problem  of  checking  the  achievability  of  a 
given  tuple  of  DoF,  (di,  g^,  . . . ,  dx),  is  also  NP-hard. 


Proof:  The  proof  of  the  hist  part  is  based  on  a  polynomial  time  reduction  from  the  maximum  independent  set  problem 
which  is  known  to  be  NP-complete.  For  a  given  arbitrary  graph  G  =  ( V ,  E),  where  V  =  K,  consider  a  K  user  interference 
channel  that  each  receiver  and  transmitter  has  a  single  antenna.  Moreover,  the  channel  coefficients  are  given  by: 


f  1,  if  j  =  k  or  (k,j)  G  E, 

[  0,  otherwise. 


It  can  be  checked  that  the  receiver  nodes  can  only  achieve  a  DoF  of  either  0  or  1,  and  those  receiver  nodes  achieving  a  DoF 
of  1  form  an  independent  set  in  G.  Thus,  the  problem  of  maximizing  the  total  DoF  for  the  above  interference  channel  is 
equivalent  to  the  problem  of  finding  the  maximum  independent  set  of  vertices  in  the  graph  G. 

In  order  to  prove  the  second  part  we  use  a  polynomial  reduction  from  the  3-colorability  problem.  The  latter  problem  is  to 
determine  whether  the  nodes  of  a  graph  can  be  assigned  one  of  the  three  possible  colors  so  that  no  two  adjacent  nodes  are 
colored  the  same.  The  3-colorability  problem  is  known  to  be  NP-Complete.  There  are  two  main  steps  in  the  construction.  In  the 
first  step,  some  dummy  nodes  are  added  to  the  channel  in  order  to  force  a  discrete  structure  such  that  each  non-dummy  node 
may  only  have  one  of  the  three  possible  cases.  The  second  step  is  to  define  the  direct  channels  in  order  to  make  a  polynomial 
reduction  from  the  3-colorability  of  an  arbitrary  graph  to  this  problem. 

For  an  arbitrary  graph  G  with  N  nodes,  we  will  construct  a  special  MIMO  interference  channel  for  which  the  achievability 
of  one  degree  of  freedom  at  each  user  is  equivalent  to  the  3-colarability  of  G.  In  our  construction,  the  MIMO  interference 
channel  will  have  two  types  of  users:  N  main  users,  each  equipped  with  3  antennas  at  their  transmitters  and  receivers  and 
117V  dummy  users  which  will  be  defined  later.  Hence  the  total  number  of  users  is  127V.  In  the  rest  of  the  proof  we  suppose 
that  each  user  (either  the  dummy  user  or  the  main  user)  wants  to  send  one  data  stream.  In  other  words  we  want  to  check  if 
the  tuple  of  all  ones  is  achievable  by  the  constructed  interference  channel  or  not. 

We  divide  the  dummy  users  into  two  groups.  The  number  of  dummy  users  in  the  first  group  is  27V  and  the  number  of  dummy 
users  in  the  second  one  is  97V.  Each  dummy  user  in  the  first  group  has  3  antennas  at  its  receiver  and  transmitter,  while  each 
dummy  user  in  the  second  group  has  two  antennas  at  its  transmitter  and  receiver.  Let  us  further  arrange  the  27V  dummy  users 
in  the  first  group  into  TV  subsets  each  containing  two  users.  We  denote  these  subsets  as  Aj,  i  =  1, TV,  A,;  =  2.  We  also 
denote  the  users  in  the  set  A,;  as  a,ti  and  aiy,  and  associate  them  to  the  7-th  main  user.  For  notational  consistency,  we  denote 
main  user  i  as  al:o-  We  will  also  use  a^kj  to  denote  the  j-th  transmit  antenna  of  user  a^fc,  where  1  <  *  <  TV,  k  =  0, 1,  2 
and  j  =  1, 2, 3.  Similarly,  we  partition  the  set  of  97V  dummy  users  in  the  second  group  into  TV  subsets  II, ,  7  =  1, ...,  TV,  each 
containing  exactly  9  dummy  users  denoted  by  6,^,  with  i  =  1,..,9.  Each  of  these  9  dummy  users  will  have  two  receiving 
antennas  which  we  denote  as  bi^m,  with  m  =  1,2. 
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Now  for  any  fixed  i  and  j,  we  consider  any  size-2  subset  of  {a»  j  :  k  =  0,1,2},  e.g.,  «i,i  j}.  For  each  fixed  i  and 

j,  there  are  exactly  3  of  these  cardinality-2  subsets.  Since  there  are  3  different  choices  of  j,  we  have  a  total  of  9  subsets  of  this 
kind  for  any  fixed  i.  Let  us  index  these  9  subsets  by  £,  £  =  1, 9,  and  assign  the  £-\h  subset  to  user  bi/  in  Bi.  Now  we  define 
the  links  in  the  channel  for  the  users  in  Ai  and  B, .  First,  the  channel  matrices  of  all  the  direct  links  for  any  of  the  dummy 
users  are  I  (where  I  is  the  identity  matrix  of  the  appropriate  size).  In  addition,  none  of  the  dummy  users  in  Bi  {i  =  1, 2, ...,  N) 
cause  interference  to  the  other  users  (which  means  that  the  channel  gains  between  their  transmit  antennas  and  the  other  users’ 
receive  antennas  are  all  zero).  Now  for  the  aforementioned  f?-th  subset  which  we  denote  as  Si/  =  {a-i/^  ,je  ,  o,i,ke2  ,jt,2 },  we 
connect  ^  and  a.i,ke2,je2  to  b,jt  and  to  bi/^,  respectively.  Here  by  connecting  a  transmit  antenna  to  a  receive  antenna 
we  mean  that  the  channel  coefficient  between  these  two  antennas  is  1.  This  situation  is  shown  in  the  figure  1  for  the  case 
Si  a  =  {a*  o,i,  Furthermore,  we  assume  that  dummy  users  a.t/,  k  =  1,2  do  not  suffer  from  any  interference. 


1:  Channels  to  the  dummy  receiver  bi/ 

Suppose  that  user  a,/  ( k  =  0, 1, 2)  uses  the  transmit  beamforming  vector  (vitk,i,Vi,k,2,  Vi/,3)-  Then  the  interference  received 
at  the  dummy  receiver  of  bi  g  will  be: 

=  (vi,ke1,je1si,k(1tvi,ke1,je2si,ke  2)  (3) 

where  Si/  is  the  signal  user  ar/  intends  to  send.  Notice  that  the  signals  which  two  different  users  want  to  transmit  are 
statistically  independent.  As  a  consequence,  if  we  want  to  have  interference  alignment  at  the  receiver  of  bi/,  so  that  this  user 
can  send  its  own  data  stream,  it  is  necessary  and  sufficient  to  have  Vifa  jt  Vi,kt2,ji2  =  0-  Hence,  having  the  interference 
alignment  at  bi/  for  all  1=1, ..,  9  is  equivalent  to  the  fact  that  users  at/,  k  =  0, 1, 2  cannot  send  their  messages  through  the 
antennas  with  the  same  index,  simultaneously.  For  example,  if  1^0,1  /  0  then  v,/\  .1  and  1^2.1  have  to  be  zero.  On  the  other 
hand,  considering  the  fact  that  each  user  needs  to  send  one  data  stream,  it  follows  that  none  of  the  users  at/,  k  =  0,1, 2,  can 
send  their  message  on  two  of  their  antennas  simultaneously,  because  otherwise  if  for  example  atjj  sends  its  message  on  two 
antennas,  then  it  would  result  in  insufficient  spatial  dimension  for  either  <2^1  or  a,  2. 

As  an  immediate  consequence  of  these  two  facts  we  have  just  mentioned,  we  can  conclude  that  the  transmit  beamforming 
vector  at  each  user  ai/,k  =  0,1,2,  must  be  proportional  to  one  of  the  vectors  [1,0, 0]T,  [0,1,  0]7  or  [0,0,1]2.  This  is  true 
specially  for  the  main  user  i.  As  we  are  not  concerned  about  the  constant  factors,  we  have  successfully  imposed  a  discrete 
structure  on  the  problem  solution  so  far.  Notice  that  each  dummy  user  bi/  has  a  total  of  2  dimensions  in  its  receiver.  Since 
we  have  aligned  the  interference  at  each  dummy  user  bi/,  these  users  can  communicate  their  data  streams  easily  along  the 
remaining  dimension  left  for  them  in  their  receivers  and  remove  interference  which  lies  in  the  other  dimension.  Moreover, 
since  in  our  constmction  the  dummy  users  a*  k  =  1,2  do  not  experience  any  interference  from  other  users  and  their  direct 
channel  is  I,  so  these  users  can  easily  achieve  one  degree  of  freedom.  Thus,  we  only  need  to  take  care  of  the  main  users. 

For  each  of  the  N  main  users,  we  must  pick  one  of  the  three  transmit  beamforming  vectors  [1, 0, 0] 1 ,  [0, 1, 0]2  or  [0,  0,  l]1 
in  order  to  achieve  interference  alignment  at  all  the  main  receivers.  We  suppose  all  the  direct  channels  for  the  main  users,  Hu, 
are  I.  For  the  cross  channels,  we  use  the  stmcture  of  graph  G  =  (V,E).  For  each  edge  (i,j)  in  G,  we  set  H,:)  =  H:ll  =  I. 
Otherwise  we  set  Hy  =  H;/I  =  0  (zero  matrix  of  appropriate  size).  Consequently,  the  main  users  i  and  j  interfere  with 
each  other  if  and  only  if  they  are  connected  to  each  other  in  graph  G.  We  claim  that  achieving  interference  alignment  in  the 
above  MIMO  interference  channel  is  equivalent  to  3-colorability  of  graph  G.  This  is  because  each  user  can  choose  3  possible 
beamforming  vectors,  each  corresponding  to  a  different  color.  If  main  user  i  chooses  one  of  the  three  possible  beamforming 
vectors  (or  one  of  the  three  colors),  then  this  beamforming  vector  cannot  be  chosen  by  any  other  main  users  adjacent  to  the 
main  user  i  in  the  graph  G,  otherwise  the  interference  would  appear  in  the  desired  signal  space  at  the  receiver  of  main  user 
i.  This  establishes  the  equivalence  between  the  3-colorability  of  G  and  the  achievability  of  one  degree  of  freedom  for  each 
user  in  the  constructed  MIMO  interference  channel.  Since  3-colorability  problem  is  NP-hard,  it  follows  that  the  problem  of 
checking  the  feasibility  of  interference  alignment  is  also  NP-hard.  ■ 
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Theorem  1  shows  that  the  problem  of  checking  the  achievability  of  a  given  tuple  of  DoF  is  NP-hard  if  all  users  (or  at 
least  a  constant  fraction  of  them)  are  equipped  with  at  least  three  antennas.  Our  next  result  shows  that  when  each  user  is 
equipped  with  no  more  than  two  antennas,  the  same  problem  can  be  solved  in  polynomial  time.  To  this  end,  we  need  to  define 
some  notations  and  make  some  observations.  First  of  all,  the  interference  alignment  problem  is  equivalent  to  finding  the  signal 
subspaces  at  the  transmitters  and  the  interference  subspaces  at  the  receivers  such  that  the  interference  alignment  conditions  are 
satisfied,  i.e., 

dk  =  dim(Sfc) 

I  Ik 

H kjSj  C  Ik  Vj  ^  k, 

where  Sk  and  Ik  denote  the  signal  subspace  at  the  transmitter  k  and  the  interference  subspace  at  receiver  k,  respectively. 
The  operator  _L  represents  the  linear  independence  of  two  subspaces.  The  first  condition  implies  that  the  signal  space  has 
dimension  dk  while  the  second  condition  says  that  the  interference  subspace  and  the  received  signal  subspace  must  be  linearly 
independent.  Finally,  the  third  condition  assures  that  the  interference  from  other  users  lies  in  the  interference  subspace  (which 
is  linearly  independent  of  the  signal  subspace). 

Notice  that  in  the  2-antenna  case,  if  dj  =  dk  =  1  and  rank(Hjy)  =  2,  and  the  interference  subspace  Ik  is  known,  then  Sj 
can  be  uniquely  determined  by  S,  =  H kjlk,  for  any  j  f  k.  Conversely,  if  Sj  is  known,  we  can  uniquely  find  the  interference 
subspace  of  user  k,  i.e.,  Ik  =  Hk.jSj.  Thus,  by  starting  from  a  node  with  a  known  subspace  and  traversing  the  interference  links 
with  full  rank  channel  matrices,  we  can  uniquely  determine  the  signal  subspaces  in  the  transmitter  sides  and  the  interference 
subspaces  at  the  receiver  sides  as  long  as  they  all  have  one  DoF.  Furthermore,  if  we  find  a  loop  of  full  rank  interfering  links, 
the  signal  subspaces  at  these  nodes  must  be  the  eigenvector  of  the  composite  channel  matrix  of  the  corresponding  loop.  To 
make  this  point  clear,  consider  a  4-user  interference  channel.  If  all  interfering  links  are  full  rank,  by  starting  from  transmitter 
1  and  use  the  loop  Txl  — ►  Rx2  — >  Tx3  — >  Rx4  — >  Txl,  we  have  the  following  relations 

J2  =  H2i«Si,  S3  =  H23lI2,  I4  =  H431S3,  =  H4111’4. 

Thus,  5)  must  be  the  eigenvector  of  the  loop  channel  matrix  IT ,[3, FIT-.1  H21 .  Using  this  observation  and  the  idea  of  traversing 
the  full  rank  interfering  channel  links,  we  can  establish  the  polynomial  solvability  of  the  problem  of  checking  the  achievability 
of  a  given  tuple  of  DoF. 

Theorem  2  For  a  K-user  MIMO  interference  channel  where  each  transmit/receive  node  is  equipped  with  at  most  two  antennas, 
the  problem  of  checking  the  achievability  of  a  given  tuple  of  DoF  is  polynomial  time  solvable. 

Proof:  By  assigning  zero  channel  weight  if  necessary,  we  can  assume  without  loss  of  generality  that  all  transmit¬ 
ters/receivers  are  equipped  with  exactly  two  antennas,  i.e.,  Mk  =  Nk  =  2,  for  all  k  =  1, 2,  •  •  •  ,K.  Furthermore,  notice  that  if 
a  user  has  zero  DoF  (dk  =  0),  then  we  can  assign  the  zero  beamforming  vector  to  this  user  and  remove  it  (both  its  transmitter 
and  receiver)  from  the  system.  Thus,  we  can  assume  1  <  dk  <  2  for  all  k  =  1,2,---  ,  K.  We  further  assume  that  all  the 
direct  channel  matrices  H kk,  k  =  1,2, ... .  K,  are  nonzero.  Now  the  problem  is  to  determine  whether  the  given  tuple  of  DoF 
(d\ ,  <J2,  ■  ■  •  ,  d  k  )  is  achievable  or  not.  To  this  end,  we  need  to  define  two  bipartite  graphs  over  the  nodes  of  the  interference 
channel  (one  side  of  the  graph  consists  of  transmit  nodes  and  the  other  consists  of  the  receive  nodes).  In  particular,  we  construct 
a  bipartite  graph  G  by  connecting  the  transmit  node  of  user  i  to  the  receive  node  of  user  j  if  and  only  if  the  channel  between 
them  is  nonzero,  i.e.,  Hjj  /  0.  Furthermore,  we  construct  a  bipartite  subgraph  G'  =  ( V .  II)  of  G  by  considering  only  the 
full  rank  links  of  G,  i.e.,  connecting  transmit  node  i  to  the  receive  node  j/j  if  and  only  if  rank(Hji)  =  2.  Notice  that  the 
link  between  transmit  node  i  and  receive  node  i  is  not  included  in  G'  even  if  rank  (FI.,,)  =  2. 

In  what  follows,  we  first  consider  a  simple  case  which  gives  us  the  idea  of  how  a  loop  of  rank  2  interfering  channels  forces 
a  discrete  structure  on  the  choice  of  signaling  subspaces  at  the  transmitters.  Then,  using  this  idea,  we  provide  the  proof  for 
the  general  case. 

Consider  a  connected  component  H  of  G  where  all  the  interfering  links  are  full  rank  and  connected,  i.e.,  the  induced 
subgraph  of  H  over  G'  is  connected  and  contains  all  the  interfering  links  of  H.  We  first  argue  that  H  can  not  contain  the 
receive  node  of  any  user  k  with  dk  =  2.  Suppose  the  contrary.  Then  the  direct  channel  matrix,  Fl/i:/r,  must  be  full  rank.  [If  FU:/;: 
is  rank  deficient,  then  the  received  signal  subspace  at  receiver  k  has  dimension  at  most  1,  which  would  make  it  impossible 
to  achieve  dk  =  2.]  We  further  claim  that  H  cannot  contain  any  other  nodes.  Since  the  direct  link  between  the  transmit  and 
receive  nodes  of  user  k  is  not  contained  in  //,  it  follows  that  the  receive  node  of  user  k  must  be  connected  to  another  transmit 
node  a  in  H.  Let  this  node  a  be  associated  with  a  user  j  (j  f  k).  Notice  that  user  j  achieves  a  DoF  at  least  1  (since  all  zero 
DoF  users  have  been  removed  from  G).  By  definition,  node  a  must  be  connected  to  the  receive  node  of  user  k  via  a  full  rank 
cross  talk  channel  matrix  FI .  Thus,  user  j  will  cause  a  nonzero  interference  subspace  to  user  /.;,  contradicting  dk  =  2.  Since 
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all  users  with  DoF  =0  has  been  removed  from  graph  G,  we  must  have  dk  =  1  for  all  receive  nodes  in  //.  For  the  other  case 
where  node  a  is  a  receive  node  of  user  j,  then  a  is  linked  to  the  transmit  node  of  user  k  via  a  full  rank  channel  matrix.  In 
this  case,  user  k  will  cause  a  2-dimensional  interference  subspace  to  user  j,  making  it  impossible  to  have  dj  >  I- 

We  now  assume  that  all  receive  nodes  in  H  have  one  DoF.  We  can  start  from  an  arbitrary  initial  node  of  H  and  use  Breadth 
First  Search  (BFS)  to  find  a  spanning  tree.  Since  each  user  has  one  DoF,  the  signal  and  interference  spaces  of  all  receive  nodes 
in  H  are  uniquely  determined  by  the  signal  (or  interference)  space  of  the  initial  node.  Since  the  initial  node  is  arbitrary,  this 
shows  that  the  signal/interference  spaces  for  all  nodes  in  H  are  linearly  related  to  each  other  (via  some  constant  composite 
channel  matrices,  see  the  discussion  before  Theorem  2).  Fixing  any  one  uniquely  determines  the  rest.  For  the  remaining  edges 
(or  links)  not  in  the  spanning  tree,  they  each  create  a  unique  loop  in  the  tree.  We  can  compute  the  composite  channel  matrices 
for  these  loops  (see  the  discussion  before  Theorem  2).  Notice  that  each  loop  matrix  (size  2x2)  has  either  one,  two  or  infinitely 
many  eigenvectors  (when  the  composite  channel  matrix  is  a  constant  multiple  of  identity  matrix).  Suppose  a  loop  matrix 
(starting  from  a  given  transmit  node,  say  6,  in  the  loop)  has  one  or  two  unique  eigenvectors,  then  the  signal  space  of  node  b 
must  be  generated  by  one  of  these  eigenvectors.  In  fact,  since  the  beamforming  vectors  of  nodes  in  H  are  linearly  related,  each 
loop  in  H  places  a  restriction  on  the  choice  of  beamforming  vector  of  node  b.  Thus,  for  any  fixed  transmit  node  b  in  H,  there 
are  multiple  restriction  sets,  each  corresponding  to  a  loop  in  H  caused  by  adding  an  edge  to  the  minimum  spanning  tree  and 
each  containing  one/two  one-dimensional  subspaces  from  which  node  6’s  signal  space  can  be  chosen.  The  receive  nodes  in  H 
can  achieve  interference  alignment  if  and  only  if  these  restricted  sets  of  one-dimensional  signal  subspaces  for  node  b  share  a 
common  one-dimensional  subspace.  Moreover,  to  ensure  each  user  in  H  achieves  one  DoF,  we  need  to  additionally  make  sure 
that  the  resulting  interference  subspaces  at  all  receive  nodes  in  H  are  linearly  independent  from  the  corresponding  respective 
signal  subspaces.  Since  the  total  number  of  restriction  sets  is  at  most  linear  in  the  number  of  edges  in  H  and  each  restriction 
set  contains  at  most  two  one-dimensional  subspaces,  checking  if  these  restrictions  have  any  common  one-dimensional  subspace 
can  be  carried  out  in  0(I\2)  time.  Moreover,  for  each  common  one-dimensional  subspace,  checking  if  the  linear  independence 
between  the  resulting  signal  subspace  and  interference  subspace  (already  aligned)  at  each  receive  node  can  also  be  performed 
in  time  that  is  linear  in  the  number  of  nodes  in  H,  or  in  O(K)  time. 

Now  we  are  ready  to  look  into  the  general  case  in  which  the  rank  1  links  are  considered  as  well  as  the  full  rank  links. 
Since  there  is  no  interfering  links  between  different  connected  components  of  G,  we  can  assign  the  signal  subspace  for  each 
connected  component  separately.  Notice  that  the  number  of  connected  components  of  G  is  at  most  K,  we  only  need  to  assign 
transmit  subspaces  for  every  connected  component  of  G  in  polynomial  time. 

Let  H  be  a  connected  component  of  G.  Let  II'  CG'  be  a  subgraph  of  H  which  contains  only  links  with  full  rank  channel 
matrices.  H'  can  be  broken  into  various  connected  components  of  C .  By  the  argument  above  for  such  components,  the 
signal/interference  spaces  for  the  nodes  in  these  connected  components  (consisting  of  at  least  two  nodes)  can  be  assigned  in 
one  of  the  two  ways: 

(Bl)  The  connected  component  contains  a  cycle  with  a  channel  matrix  that  is  not  equal  to  a  constant  multiple  of  the  identity 
matrix.  In  the  case,  the  beamforming  vectors  of  all  nodes  can  be  determined  from  the  eigenvector(s)  of  a  certain  loop 
channel  matrix.  In  this  case,  there  are  at  most  two  possible  choices  of  signal/interference  space  for  each  node. 

(B2)  The  connected  component  has  no  loops  (i.e.,  forms  a  tree)  or  if  every  loop  has  a  composite  channel  matrix  that  is  a 
constant  multiple  of  the  identity  matrix.  In  this  case,  the  signal/interference  spaces  of  all  nodes  are  linearly  related  to  one 
another.  The  signal/interference  space  of  one  node  can  be  fixed  at  an  arbitrary  one-dimensional  subspace.  Once  this  is 
fixed,  the  signal/interference  spaces  of  other  nodes  can  be  derived  uniquely. 

Consider  a  rank-1  interfering  link  in  H  with  channel  matrix  H,_,  (i  ^  j).  If  user  j  transmits  in  the  null  of  Hl;/,  then 
the  signalling  subspace  of  user  j  is  known,  i.e.,  S3  =  Null(Hij).  Otherwise,  the  interference  subspace  at  user  i  is  known, 
i.e.,  X,  =  Range(Hij).  This  is  because  di  >  1,  so  we  have  diml^  <  1.  This  plus  the  fact  that  Range(Hij)  C  X%  implies 
li  =  Range(Hij).  Therefore,  we  can  assign  a  Boolean  variable  xi3  to  each  rank-1  channel  H,,,  with  “xi3  =  1”  representing 
Sj  =  Null(Hij)  and  “r,,  =  0”  signifying  Xi  =  Range(Hij).  In  this  way,  we  associate  a  Boolean  variable  x.l3  for  each  rank-1 
crosstalk  channel  matrix  HtJ  in  H. 

Next  we  represent  the  interference  alignment  condition  at  each  receive  node  of  H  using  the  Boolean  variables  { x7J }  (plus 
some  auxiliary  Boolean  variables  {y3,  Zij,  Zij  defined  below).  Suppose  user  i’s  receive  node  is  in  H.  We  consider  the  cases 
di  =  2  and  d3  =  1  separately. 

Case  di  =  2:  In  this  case  X,  =  0,  so  we  must  have  x,3  =  1.  We  rewrite  this  condition  in  the  form  of  two  2-SAT  clauses 

%ij  V  yi,  Xij  V  Hi,  for  all  j  ^  i  and  rank(H^)  =  1,  (4) 

where  yi  is  an  auxiliary  Boolean  variable.  In  this  case,  the  satisfaction  of  (4)  and  the  condition  that  the  receive  node  of  user 
i  is  not  connected  to  other  users’  transmit  nodes  via  rank-2  links  is  equivalent  to  achieving  one  DoF  for  user  i. 

Case  di  =  1  and  rank(H^)  =  1:  In  this  case,  then  the  received  signal  subspace  is  =  Range(Hu)  and  dimX^  =  1,  so 

that  all  the  interference  at  the  receive  node  of  user  i  must  be  aligned  in  an  one-dimensional  subspace  that  is  linearly  independent 
of  Range(Hii).  We  need  to  further  consider  several  subcases,  depending  on  if  the  receive  node  of  user  i  is  connected  to  other 
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transmit  nodes  via  rank-1  or  rank-2  links.  In  particular,  if  the  transmit  nodes  of  users  j  and  k  are  connected  to  receive  node  i 
via  rank-1  links,  then  the  interference  alignment  condition  requires  the  satisfaction  of  the  following  2-SAT  clauses 


Xij  V  Xik , 
Xij  V  Z'i'j ,  X,:i  V  Zjj  , 


for  all  j  7^  k  i  such  that  ra,nk(H,(7)  =  rank(H,fc)  =  1  and  Range(Hij)  7^  Range(H.ik) , 
for  all  j  i  such  that  rank(Hjj)  =  1  and  Range(H,j)  =  Range(Hu), 


(5) 


where  Zij  is  a  dummy  Boolean  variable,  and  the  last  condition  corresponds  to  the  linear  independence  requirement  of  the 
signal/interference  subspaces.  Moreover,  if  there  is  a  rank- 2  link  connecting  the  receive  node  of  user  i  to  the  transmit  node 
of  user  £,  £  i,  i.e.,  H,p  is  full  rank,  then  the  receive  node  of  user  i  is  in  //'.  Consequently,  the  transmit  strategy  of  user 
l  has  only  two  possibilities  B1  and  B2  as  outlined  above.  For  the  Case  B1  where  the  transmit  node  of  user  £  can  pick  one 
of  the  two  possible  beamforming  vectors  \lj,  v] ,  we  define  a  Boolean  variable  zp  with  “zi  =  0”  representing  v°  is  chosen, 
while  “z(  =  1”  signifying  v]  is  chosen.  Now  the  interference  alignment  for  user  i  requires  the  satisfaction  of  following  2-SAT 
clauses 

zp  V  x^,  for  all  j  7^  £  7^  i  such  that  rank(Hy)  =  1,  rank(H^)  =  2  and  H ^  Range(Hij) , 

zp  V  x^,  for  ai\  j  i  such  that  rank(Hjj)  =  1,  rank(H^)  =  2  and  H,jy}  ^  Range(Hij) . 


(6) 


If  in  Case  B1  the  transmit  node  of  user  £  must  pick  a  unique  vector  v°,  then  we  must  have  Zp  =  0  and  xi:)  =  1  if  0 

Range(H-ij),  and  zp  =  0  if  G  Range(Hjj).  The  latter  conditions  are  equivalent  to  the  satisfaction  of  the  following  2-SAT 

clauses: 

zp  V  x^,  zp  V  Xij,  zp  V  x^,  for  all  j  ^  £  7^  i  s.t.  rank(Hij)  =  1,  rank(H^)  =  2  and  ^  RangeiYLij), 

zp  V  x^,  zp  V  x-ij,  for  all  j  ^  ^  i  s.t.  rank(H.y  )  =  1,  rank(H^)  =  2  and  G  RangeiYLij). 

To  ensure  linear  independence  of  the  signal  and  interference  subspaces  for  user  i,  we  must  make  sure  the  satisfaction  of  the 
following  2-SAT  clauses 

zp  V  yi,  zp\!  yi,  for  all  t  /  i  s.t.  rank(Hif)  =  2  and  H  uvj  G  Range(Hu), 

zp  V  yi,  zp  V  y.j ,  for  all  £  /  i  s.t.  rank(H.^)  =  2  and  H^v°  G  Range(H.u), 


(7) 


(8) 


where  y,  is  a  dummy  Boolean  variable.  Now  we  consider  Case  B2.  Suppose  the  receive  node  of  user  i  lies  in  a  connected 
component  H"  of  H' .  Then,  for  each  pair  of  receive  node  of  users  i  and  £  in  II"  (i,  7^  £),  there  exists  a  (efficiently  computable) 
nonsingular  matrix  Gu  such  that 

=  Gulp. 


To  ensure  this  condition,  the  following  2-SAT  clauses  must  be  satisfied  for  all  transmit  nodes  j  and  k  in  II": 

Xij  V  xpk,  for  all  j  i,  k  £  s.t.  rank(H^fc)  =  rank(Hy )  =  1,  and  GipRange( Upk)  7^  Range(Hij) . 


(9) 


Furthermore,  to  make  sure  that  the  signal  and  interference  subspaces  are  linearly  independent  at  the  receive  node  of  user  i, 
we  must  have  for  all  transmit  node  j  in  II"  that  the  following  2-SAT  clauses  are  satisfied 


Xpj  V  Zij,  Xij  V  Zij, 


for  all  j  /  j  s.t.  Range(Hij)  =  Range(Hi 


(10) 


Finally,  we  notice  that  the  Boolean  variables  {xpp,  ztp }  all  represent  the  signaling  strategies  of  user  £.  We  must  ensure  that  these 
signaling  strategies  are  compatible.  In  other  words,  we  can  not  simultaneously  have  both  Sp  =  Null(H,()  and  Sp  =  NuIHVl.jp) 
(j  7^  i),  unless  of  course  the  two  null  spaces  are  equal.  This  implies  that  we  should  have 


xu  V  Xjp,  for  all  i  7^  j  7^  £  s.t.  rank(Hjf)  =  rank(Ef^)  =  1,  Null{H.ip)  7^  Null(iijp). 


(11) 


Moreover,  if  the  transmit  node  of  user  £  is  also  in  H"  and  its  transmit  beamforming  vector  must  be  chosen  from  the  set 
{Vj,  v], }  (Case  Bl).  Then,  by  a  similar  argument,  we  must  also  ensure  the  following  compatibility  conditions: 


Xpp  V  zp,  for  all*  7^  j  7^  £  s.t.  rank(H^)  =  1,  rank(H^)  =  2,  v°  ^  NulKiLip), 
xu  V  zp,  for  all  i  7^  j  7^  £  s.t.  rank(H^)  =  1,  rank(H_^)  =  2,  ^  Null(Hip). 


(12) 


In  case  of  B2  (i.e.,  H"  is  a  tree  or  all  loop  matrices  are  constant  multiples  of  identity  matrix),  then  the  transmit  subspace  of 
user  £  (which  lies  in  H ")  can  be  chosen  continuously  (rather  than  from  a  discrete  set  {v°,  v] }).  In  this  case,  the  compatibility 
condition  (11)  is  sufficient;  there  is  no  additional  compatibility  condition  needed. 

Case  di  =  1  and  rank('Hri)  =  2:  In  this  case,  if  the  transmit  node  of  user  i  is  connected  to  a  receive  node  of  user  j  via 
a  rank-1  link,  then  Xji  =  1  signifies  the  use  of  transmit  beamforming  subspace  of  Null(Hji )  for  user  i\  else  if  transmitter  i 
is  in  H"  so  that  its  transmit  beamforming  direction  must  be  chosen  from  v°,  v,1 ,  corresponding  to  z,  =  0  and  1  respectively 
(Case  Bl).  [Case  B2  corresponds  to  the  continuous  selection  of  beamforming  vector  for  user  7;  no  2-SAT  clause  is  needed  in 
that  case.]  In  the  first  case,  the  signal  subspace  at  receive  node  of  user  i  becomes  HiiNull(Hji),  while  in  the  second  case. 
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the  signal  subspace  is  V°,  or  HjjvJ.  We  must  make  sure  the  signal  subspace  is  linearly  independent  from  the  interference 
subspace  of  user  i.  This  implies  that  the  following  2-SAT  clauses  must  be  satisfied: 

Xji  V  Xu,  for  all  i  ^  j  ^  l  s.t.  rank(H^)  =  rank(Hji)  =  1,  Range(Hu)  =  H uNull(Hji) , 
xu  V  Zi,  for  all  i  ^  j  s.t.  rank(H^)  =  l,i6  H",  HjjV?  £  Range{ H«),  (13) 

xu  V  Zi ,  for  all  i  ^  j  ^  i  s.t.  rank(H^)  =  1,  i  £  H" ,  £  Range(H.u). 

It  can  be  checked  that  the  DoF  tuple  (di,  d^, ■ ■  ■  ,  d k )  is  achievable  if  and  only  if  conditions  (4)-(  1 3)  are  satisfied  for  some 
binary  realizations  of  Boolean  variables  {xij,  yi,  Zi,  Zij}.  Moreover,  the  number  of  such  2-SAT  clauses  is  polynomial  in  K 
(in  fact  ()(K1)).  Hence,  we  have  transformed  the  DoF  feasibility  problem  in  polynomial  time  to  an  instance  of  2-satisfiability 
problem.  The  latter  problem  is  known  to  be  solvable  in  polynomial  time.  ■ 


IV.  Strategies  for  Linear  Transceiver  Design 


In  this  section,  we  propose  linear  transceiver  design  algorithms  for  interference  channels.  Using  linear  transceivers  introduced 
in  Section  II,  the  estimated  data  stream  at  receiver  k  is  given  by 


K 


HfcJ  visi  +  Ufc  nfc 

3=1 

and  the  SINR  value  for  the  g-th  data  stream  of  user  k,  7J?,  is  given  by 


7 1  = 


KKkkvl\ 


Q  12 


^Kll2- 


E 


where  u|  and  v|  denote  the  <y-th  column  of  U/,  and  V /. ,  respectively.  Using  a  linear  MMSE  receiver  uf. ,  we  have 

7 1  =  vf  HL(a2I  +  Y, 


One  possible  choice  of  the  utility  function  for  the  Ar-th  user  could  be  the  sum  of  the  SINR  values  of  its  data  streams,  i.e., 

Ck=Y  7fc 

Q 

=  5>fH^(a2I+  £ 

However,  maximizing  ^  does  not  lead  to  the  maximization  of  the  total  DoF  in  high  SNR.  Therefore,  we  need  to  introduce 
another  utility  function  in  order  to  capture  more  DoF  for  each  user.  First,  we  define  U  (7)  =  as  the  utility  function  of  the 
g-th  data  stream  of  user  k  and  then,  we  consider  Uk  =  E<j^(7fc)  as  the  utility  function  of  user  k.  Thus,  at  high  SNR,  Uk 
equals  the  DoF  at  receiver  fc,  while  at  low  SNR,  Uk  equals  the  sum  SINR.  Using  the  rank  one  update  of  the  matrix  inverse 
term  in  SINR  value,  we  can  rewrite  Uk  as 

Z4  =  ]Tvf  H£fc(a2I+  £  HyvJvjX-J-'Huv’. 

9  0» 


The  proposed  utility  function  preserves  fairness  among  different  data  streams  of  user  k  and  also  closely  approximates  the  sum 
DoF  at  high  SNR. 

Directly  optimizing  linear  transceivers  U/.’s  and  V/,.’s  requires  specification  of  DoFs  dk  in  advance,  since  the  dimension  of 
U k  and  Vfc  depends  on  dk-  To  avoid  this  explicit  dependence  on  dk,  we  consider  optimizing  the  transmit  covariance  matrix 
instead  of  linear  transceivers  Ufc  and  Vfc.  In  particular,  we  write  the  utility  function  of  user  /,:  as 


Uk  =  Tr 


k 

HfcfcQfcHfcfc(er2I  +  'Y  Hfc^QfHfc^)-1 


e= 1 


(14) 


where  Qfc  =  ^  vfc(vfe)T 's  the  transmit  covariance  matrix  of  the  fc-th  user.  However,  this  utility  function  still  does  not  related 
to  the  sum-rate  directly.  In  the  sequel,  we  propose  a  weighting  approach  to  relate  the  utility  function  in  (14)  to  the  rate  of 
user  k. 

Consider  the  well-known  weighted  sum-rate  maximization  problem 


K 

max  Y  akRk 
{CWf=i 

s.t.  Tr(Qfc)  <  pfc,  Qfc  0 


(15) 
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where  Rk  =  logdet  ^1  +  HfcfcQfcH^fe(cr2I  +  Y^ejtkH-keQeH-ke)  1j  is  the  achievable  rate  of  user  k  and  the  coefficient  ak 
denotes  user  k’s  weight.  Using  linear  algebra  to  simplify  the  objective  function,  the  above  problem  can  be  reformulated  as  the 
following  equivalent  optimization  problem: 


min 

1Q*}£U 


K 

E 

k= 1 


K 


—  1\ 


ak  log  det  I  -  UkkQkUlk  er2I  +  Y  H^QfH 


kt 


s.t.  Tr(Q k)<pkl  Qfc  ^  0, 


(16) 


where  the  term  inside  the  determinant  is  linearly  related  to  the  utility  function  in  (14).  Similar  to  [51],  we  reformulate  the 
problem  (16)  by  further  introducing  new  optimization  variables  W/;:  £  R M x  M ,  k  =  1,  2, . . . ,  K,  to  obtain  the  following 
equivalent  optimization  problem 

K  K 

min  VafcTr(Wfc3fe(Q))  -  V  logdet  Wfc  (17) 

{Qfc,Wfe}f=1  “  ti 

s.t.  Tr(Qfc)  <  pk  Qfe  R  0, 


where  Q  =  (Qi,  Q2,  • .  - ,  Q/v)  and 

K 

flfc(Q)  =  I  -  HfcfcQfcH^(n2I  +  Y  HfefQfH^)”1. 

e= l 

The  optimization  problem  (17)  is  convex  in  { W/,.  }k=i .  By  checking  the  first  order  optimality  condition,  the  optimal  W k  is 
given  by 

Wf  =  I  +  HttQtH^((T2I  +  ^HuQ^H^)-1,  V  k  =  1, 2, . . . ,  K.  (18) 

tjik 


By  plugging  back  the  optimal  in  (17),  we  immediately  see  the  equivalence  of  (17)  and  (16).  Furthermore,  in  order  to 

have  a  distributed  approach,  we  let  users  update  their  transmit  covariance  matrix  independently.  Therefore,  for  fixed  {WfcjjL-p 
user  k  can  solve  the  following  optimization  problem  to  update  its  transmit  covariance  matrix: 


max 

Qt 


afcTr 


K 

WkHkkQkUlk(a2I  +  Y  HfcrQ^H^)-1 


1  =  1 


+  E  aJTr 


K 


Hfda2! 


T\- 1 


1=1 


s.t.  Tr(Qfe)  <  pk  Qk  R  0. 


(19) 


Unfortunately,  this  objective  function  is  not  convex.  In  order  to  make  the  problem  convex,  we  keep  the  first  term  in  the  objective 
function  (which  is  a  concave  function  of  Q k )  and  use  the  local  linear  approximation  of  the  second  term,  i.e., 


E  ai  Tr  WTT,,QJT;,(Cm.  +  HjkQkHjk) 

j/fc 


T  \-ll 


T  \-l 


E  f*/iv  {w.JTa.Q.-H;.,  \(Cjk  +  UjkQkU jk) 

~{Cjk  +  HjfcQfcH^.)  1HjkQkHjk(Cjk  +  HjfcQfcH2fc) 

+  (Cjfc  +  HjfcQfeH^.)  1HjfeQfcHjfc(Cj/c  +  1  | 


T  \-l 


where  Qk  is  the  local  value  of  transmit  covariance  matrix  at  the  previous  iteration  and  Cjk  is  the  received  signal  covariance 
matrix  at  receiver  j  excluding  the  fc-th  user’s  signal,  i.e., 


Cjk  —  cr2I  +  'Y 
e^k 


(20) 


By  substituting  the  above  approximation  in  (19)  and  simplifying  the  resulting  optimization  problem,  we  get 


max 

Qfc 

s.t. 


K 


ctfcTr  W fcHfcfcQfcHjyj. (<r2I  +  Y  HwQ/H^)"1 

1  =  1 

Tr(Qfc)  <  pk  Qk  R  0 


-  Tr  [BfeQfc] 


(21) 
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where  Bfe  =  J2j^h  Hji  (C  ,.  1  l;f,Q,  1 1 'n  W  II  QiH'.iC;;  H',  )  11  A.  The  objective  function  in  (21)  considers 

the  effect  of  transmit  covariance  matrix  of  user  k  on  not  only  its  own  rate,  but  also  those  of  others  in  the  interference  channel. 
Similar  balanced  approaches  have  been  considered  in  related  works,  see  [42],  [52-54].  By  further  simplification  of  the  objective 
function  and  using  the  Schur  complement,  the  problem  can  be  formulated  as  the  following  Semi-definite  Programming  (SDP) 
form: 


min  afcTr  [Y]  +  Tr  [BfcQfc]  (22) 

Qfc,Y 

s.t.  Tr(Q  k)<Pk,  Qk>0, 

‘  Ckk  +  UkkQkUlk  (WfcCfcfc)1/2  1 

(WfeCfcfe)V2  Y  J  U‘ 


Note  that  the  matrices  Wj  and  Ckk  are  updated  by  (18)  and  (20)  respectively.  Thus  ~WkCkk  is  Hermitian  positive  semi- 
definite.  Hence,  for  fixed  matrices  {WfcjjL-p  user  k  can  update  its  transmit  covariance  matrix  Q/,  by  solving  the  above  SDP 
problem. 


1.  initialize  with  Q*.  =  and  Wk  =  I,  for  all  k  =  1,  2,  ■  ■  ■  ,  K 

2.  repeat 

3.  for  k  =  1, 2,  ■  •  ■  ,  K  do 

4.  update  W k  according  to  (18) 

5.  update  Qk  by  solving  (22) 

6.  update  W;,.  according  to  (18) 

7.  until  convergence,  or  |jQ  —  Q||  <  e 


2:  An  iterative  SDP  approximation  algorithm  for  sum-rate  maximization 

Note  that  the  second  term  in  (19)  is  a  convex  function  of  Q/,.  Therefore,  the  local  linear  approximation  is  a  lower  bound 
which  is  tight  at  the  current  point  Q/, .  Hence,  by  solving  (22),  we  minimize  a  concave  lower  bound  of  the  original  utility 
function  (19).  Since  the  previous  iterate  Q/,  is  feasible  for  (19),  it  follows  that  the  system  utility  function  (i.e.,  the  objective 
function  of  (19))  is  non-decreasing.  Furthermore,  (19)  is  bounded  from  above  and  this  implies  the  sequence  of  objective  function 
values  generated  by  the  proposed  algorithm  converges.  The  following  theorem  further  establishes  the  iterate  convergence  to  a 
stationary  point  for  the  proposed  algorithm.  In  order  to  prove  that  each  limit  point  of  this  algorithm  is  a  stationary  point  of 
the  original  problem  we  need  the  following  lemma. 


Lemma  1  If  the  direct  channel  matrices  are  full-rank  and  tall,  the  function: 

f  K 


/2fc-i(Qfc)  =  Ofe  Tr 


WfcHfcfcQfcH^(a2I  +  ^  HfcfQfH^)-1 


i=i 

is  strictly  concave  with  respect  to  symmetric  positive  semidefinite  matrix  Qk.  Moreover, 
strictly  convex  with  respect  to  ~W  k. 


—  Tr  [BfcQfc]  (23) 

the  objective  function  of  (17)  is  also 


Proof:  See  Appendix,  Section  VI. 


Theorem  3  Assuming  that  the  direct  channel  matrices,  Hfc/j,  are  full  rank  and  tall,  then  every  limit  point  of  the  proposed 
algorithm  is  a  stationary  point  of  (15). 

Proof:  According  to  Lemma  2  (see  Appendix,  section  VI),  every  stationary  point  of  (17)  is  also  a  stationary  point  of 
(15).  Therefore,  we  only  need  to  prove  that  every  limit  point  of  the  proposed  algorithm  is  a  stationary  point  of  (17).  To  this 
end,  let  us  define  the  auxiliary  variable  X*  =  [X]  }|£fj ,  where  Xil2k_1  =  Q  j.  is  the  updated  transmit  covariance  matrix  of 
user  k  at  i-th  iteration  and  XjA:  =  Wj.:  is  the  updated  weight  matrix  of  user  k  at  i-th  iteration.  In  particular,  we  define  Qj.  to 
be  the  solution  of  the  following  problem 

Xk-1  =  Ql  =  Argmax  f2k- i(Qfc;  X‘, X* , . . . , X*fe_2, X^, X^\ . . . , X^1) 

Qk 

s.t.  Tr(Qfc)  <  pk ,  Qfc  ^  0 
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where  f2k-i{Qk',  X^,  X|, . . . ,  X2fe_2,  , . . . ,  X^-1)  is  the  objective  function  of  (21)  which  is  the  local  concave 

lower  bound  approximation  of  the  objective  function  in  (17)  as  discussed  in  section  IV.  Similarly,  we  define  XJ,fc  =  Wj.  to 
be  the  updated  weight  matrix  of  user  k  at  iteration  i,  i.e.. 


X^fc  =  Wj,  =  Argmax  f(Wk:X\, 

wfc 


,X 


,X 


2  —  1 


r  i —  1 

i^2k-li^2k+li^2k  ’■ 


^2K) 


where  /(■;  •)  is  the  objective  function  in  (17). 


Let  Xi4(X*1,Xj,...,X*A.)  be  the  tuple  of  transmit  covariance-weight  matrices  and  X*  be  a  limit  point  of  the  sequence 
{X*}?^.  Therefore,  there  exists  a  subsequence  of  indices  {i\,  12, ij, ...}  such  that 

lim  X^'  =  X* 

j— »oo 

First,  we  will  prove  that  lim  X  ,'  —  X  j'  =  0  by  using  contradiction.  Suppose  the  contrary.  Hence,  by  further  restricting  to 

j^oo 

a  subsequence  if  necessary,  we  have 

37*  >  0  such  that  >  7*,  Vj, 

where  7*3  =  ||X^+1  —  Xp'||.  Let  S y'  =  — — Xl  .  Since  ||S^||  =  1,  according  to  Bolzano-Weierstrass  theorem,  there  exists 
a  subset  of  indices,  denoted  by  I,  and  a  unit  length  matrix  S*  such  that 

lim  S;j  =  S*. 

ij  E  J,  j — >-oo 

Obviously,  0  <  e-y*  <  7*’  for  every  e,  0  <  e  <  1.  Moreover,  since  the  feasible  set  is  convex,  X l’  +  67*  Sv'  belongs  to  the 
feasible  set.  Therefore,  according  to  the  definition  of  X^  and  using  the  concavity  of  /j,  we  have 

fi  (Kj+1  ;X**)  >  h  (x;j  +  e7*  si3 ;  X* )  >  A  (X? ;  X4' ) .  (24) 


On  the  other  hand,  the  value  of  the  objective  function  in  (17)  is  always  increasing  and  bounded  from  above.  Moreover,  the 
feasible  set  is  closed  and  therefore  X*  is  in  the  feasible  set.  Hence,  the  value  of  objective  function  converges  to  /(X*),  i.e., 

lim  A (X*/ ;  X®3 )  =  lim  /^X^jX4*)  =  /(X*). 

J— >00  J— >00 

Therefore,  letting  j  — »  00  with  ij  £  I  in  (24)  yields 

fi (X j  +  e7*St;  X*)  =  /(X*),  V  e  G  [0, 1], 

which  contradicts  the  strict  concavity  of  /i(*)  (c.f.  Lemma  1).  Therefore,  lim  X^J+1  —  X^  =  0,  or  equivalently,  we  have 

3~>  °° 


lim  X'J+1  =  lim  X*3  =  XJ. 

j — KX>  j — >-00 


On  the  other  hand,  X^+1  is  the  local  maximum  of  ,/j  (■.  X'' ).  Hence, 


Tr 


VXl/i(Xlj+1;X®0  (Xi-X1/) 


<0, 


for  any  feasible  point  X  | .  Letting  j  — >  oo  and  using  (25)  yield 


Tr 


VXl/i(Xj;  X*)  (Xx  —  Xj) 


<  0. 


Since  /(•)  and  /i(-,X*)  have  the  same  gradient  with  respect  to  X*  at  point  X*,  it  follows  that 


Tr 


VXl/(X*)T(X!-Xj) 


<  0. 


Repeating  the  same  argument  for  all  k  =  1, 2, . . . ,  2 K,  we  get 


Tr 


VXfc/(X*)T(Xfc  —  X£) 


<0,  Vfc  =  l,2,  ...,2A'. 


By  summing  up  all  the  equations  for  all  k’s  we  get. 


Tr 


Vx/(X*)T(X  —  X*) 


<  0 


(25) 


which  implies  the  stationarity  of  X* .  ■ 

A  couple  of  remarks  are  in  order.  First,  in  the  proof  of  Theorem  3  we  have  only  used  the  strict  concavity  of  function  /(•). 
Consequently,  the  proof  works  for  other  objective  functions  that  have  the  same  property  and  using  similar  methods,  e.g.  [42]. 
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Second,  after  solving  (22)  to  get  the  solution  Q£,  we  can  update  the  transmit  covariance  matrix  by  using  relaxation  parameter 
0  <  a  <  1,  i.e.,  Qfc  < —  oQ)  +  (1  —  q)Q k-  It  can  be  shown  that  the  convergence  result  of  Theorem  3  holds  even  by  using  a 
fixed  relaxation  parameter. 

An  alternative  to  solving  (22)  at  each  iteration  is  to  update  the  transmit  covariance  matrix  in  a  totally  unselfish  manner,  i.e., 
solving  the  following  problem 

min  Tr[BfcQfc] 

Qfc 

s.t.  Tr(Qfc)  =  pk,  Qk  >  0.  (26) 

The  above  problem  has  a  closed  form  solution  Q/,  =  p/i:qq7  ,  where  q  is  the  eigen  vector  of  B/,  corresponding  to  its  minimum 
eigen  value.  This  unselfish  approach  requires  all  the  users  to  exhaust  all  their  transmit  power,  potentially  causing  unnecessary 
interference.  Furthermore,  it  results  in  one  DoF  for  each  user  because  Q/,.  is  always  rank  one.  In  cases  that  the  all  one  DoF 
vector  is  not  appropriate  either  because  it  is  not  achievable  or  because  it  is  too  conservative,  the  above  unselfish  strategy  cannot 
lead  to  the  maximization  of  sum  DoFs. 

In  general,  if  we  know  the  DoF  of  each  user  a-priori  and  allocate  equal  power  across  the  data  streams,  we  can  update  the 
transmit  beamformer  of  user  k  by  solving  the  following  optimization  problem: 

min  Tr  [V^BfcVfc]  (27) 

Vfc 

s.t.  vJTVfc  =  ^  i. 

dk 

This  approach  lets  each  transmitter  use  maximum  power  and  pick  a  transmit  covariance  matrix  V k  so  as  to  minimize  the 
total  interference  to  other  users.  It  has  a  closed  form  solution  V k  whose  columns  are  proportional  to  the  eigenvectors  of  B;,. 
corresponding  to  its  dk  smallest  eigen  values,  scaled  appropriately  to  satisfy  the  power  budget  constraint. 


1.  initialize  with  Vk  =  0  and  W k  =  1,  for  all  k  =  1, 2,  •  • 

•  ,K 

2.  repeat 

3.  for  k  =  1, 2,  ■  •  •  ,  K  do 

4.  update  V/.  by  solving  (27) 

5.  update  W k  according  to  (18) 

6.  until  convergence 

3:  The  unselfish  algorithm  for  sum  DoF  maximization 


V.  Simulation  Results 

In  this  section,  we  present  some  numerical  results  comparing  the  Decentralized  Interference  Alignment  (DIA)  method  [2] 
with  our  proposed  methods.  All  numerical  results  are  averaged  over  20  channel  realizations.  In  each  channel  realization,  the 
path  loss  of  the  channel  coefficients  are  generated  by  a  relay-backhaul  model  provided  by  Huawei  Technologies.  We  consider 
19-hexagonal  wrap-around  cell  layout.  We  randomly  choose  K  base  stations,  each  serving  a  random  relay  in  its  own  cell  at 
each  time  slot.  Each  base  station  serves  different  relays  in  its  own  cell  orthogonally.  Therefore,  at  each  time  slot,  the  base 
station-relays  form  an  interference  channel.  The  relays  have  fixed  locations  so  the  the  system  has  enough  time  to  leant  the 
channels.  The  MIMO  channel  coefficients  are  modeled  by  the  standard  single  tap  Rayleigh  fading  model.  We  consider  linear 
MMSE  receivers  and  equal  power  budget  for  all  users  and  for  all  methods.  To  implement  DIA,  we  need  to  predetermine  DoF 
for  all  users.  In  all  simulations  DoFs  are  set  to  be  equal  for  all  users. 

In  the  first  numerical  experiment,  we  consider  K  =  10  base  station-relay  pairs,  each  equipped  with  M  =  2  antennas.  The 
predetermined  degrees  of  freedom  used  in  DIA  method  are  d\  =  di  =  . . .  =  d  =  d  =  1.  Figure  4  represents  the  sum-rate 
comparison  between  the  proposed  methods  and  DIA.  As  Figure  4  shows,  the  proposed  method  yields  substantially  higher 
sum-rates  in  this  case.  In  fact,  the  sum-rate  achieved  by  the  DIA  method  does  not  grow  linearly  with  SNR,  indicating  that 
interference  alignment  has  not  been  achieved. 

It  is  known  that  the  DIA  method  works  well  for  the  K  =  3  case  where  interference  alignment  is  possible  [2].  We  consider 
the  case  of  K  =  3  transceiver  pairs  each  equipped  with  M  =  3  antennas  and  one  DoF  is  considered  for  each  transmitter.  As  can 
be  seen  in  Fig.  5,  the  selfish  and  the  SDP  approach  works  well  in  low  SNR,  but  is  outperformed  by  the  DIA  approach  in  high 
SNR  region  where  the  interference  alignment  effect  begins  to  kick  in.  Interestingly,  our  Unselfish  approach  for  interference 
alignment  outperforms  the  DIA  algorithm  in  the  entire  practical  SNR  range.  Although  the  DIA  method  and  the  Unselfish 
approach  both  achieve  a  sum-rate  that  increases  linearly  with  SNR,  the  Unselfish  approach  has  a  better  offset  compared  to  the 
DIA  method. 
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4:  Sum-rate  vs.  SNR:  K  =  10,  M  =  2,  d  =  1 


5:  Sum-rate  vs.  SNR:  K  =  4,  M  =  3,d  =  1 


VI.  Appendix:  Proof  of  Lemmas  1  and  2 


Lemma  1  If  the  direct  channel  matrices  are  full-rank  and  tall,  the  function: 

K 


/2fc-i(Qfc)  =  cifcTr 


WfeH,.feQfcH^(a2I  +  Y,  H«QfH^)- 


-  Tr  [B/,Q/, 


(28) 


i=i 

is  strictly  concave  with  respect  to  symmetric  positive  semidefinite  matrix  Qfe.  Moreover,  the  objective  function  of  (17)  is  also 
strictly  convex  with  respect  to  Wj.. 


Proof:  Using  the  notations  we  have  defined  so  far,  /2fc-i(Qfc)  is  given  by 

/2fc-i(Qfc)  =  otk  [Tr  (WfcCfefe(Cfcfe  +  HfefeQfeH^.)-1)  —  Tr(BfcQfe)]  . 

The  second  term  in  f2k-i{Qk )  is  linear  in  Q/;  and  does  not  change  the  strict  concavity  of  the  function.  Hence,  it  suffices  to 
show  the  strict  concavity  of  Tr  ( WkCkk{Ckk  +  HfcfcQ^H^.)-1).  To  do  so,  it  is  enough  to  prove  that  the  function  is  strictly 
concave  in  any  feasible  direction.  We  drop  the  index  k  for  notational  simplicity.  Let  us  consider  a  feasible  direction  denoted 
by  a  symmetric  matrix  D  ^  0  of  appropriate  size  and  a  scalar  t  >  0.  We  further  define  the  notation  D  =  HDHT  and  the 
function 


=  Tr(WC(C  +  X  +  tr>')) 
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where  X  =  HQHT  and  C  +  X  +  /  D  are  positive  definite  matrices.  Since  the  direct  channel  matrix  H  is  tall  and  full-rank,  it 
follows  that  D  7^  0.  Moreover,  by  the  definitions  of  W  and  C  (18)-(20),  we  know  the  matrix  WC  is  symmetric  and  positive 
definite.  It  suffices  to  show  the  strict  concavity  of  hi  with  respect  to  t  for  each  symmetric  D. 

To  prove  the  strict  concavity  of  hi,  we  will  calculate  the  second  order  derivative  of  hi  with  respect  to  t  and  prove  that  it 
is  negative.  If  we  denote  B  =  (C  +  X  +  /D  )_1,  then  the  first  order  derivative  is  given  by 


dhi 

dt 


In  addition,  we  know  that 


5Tr(wC(C  +  X  +  fD')-1) 
dt 


Tr  ((VBTr(WCB))T  . 


<9B 

~dt 


BD  B, 


VBTr(WCB)  =  CTWr, 


which  further  implies 

=  -Tr  (WCBDB)  .  (29) 

•  q2  i 

In  a  similar  way,  we  can  calculate  the  second  order  derivative 

^ -  =  —  2Tr  (d'BWCBD'b)  =  -2Tr  ((DBD')(BWCB))  .  (30) 

As  C  and  B  are  positive  definite  we  can  conclude  that  BWCB  is  also  positive  definite  and  D  BD  is  positive  semi-definite. 
Since  D  BD  f  0,  it  must  have  at  least  one  non-zero  eigenvalue  A  >  0  with  a  corresponding  eigenvector  v.  Then, 

=  -TV  ((D'BD  )(BWCB)J 

<  -ATT  (vtv(BWCB)) 

=  -ATV  (vt(BWCB)v) 

=  -ATV  (vT(B(WC)5(WC)5B)vj 
=  -A||(WC)5Bv||2  <  0. 


Next  we  prove  that  the  objective  function  in  (17)  is  strictly  convex  in  W;..  The  first  summation  in  (17)  is  linear  in  W /,’s  and 
does  not  change  the  strict  convexity.  Moreover,  the  objective  function  in  (17)  is  decomposable  over  W\..  Hence,  to  accomplish 
the  proof  of  lemma  1  we  just  need  to  prove  the  strict  convexity  of  —  log  det(W/,.)  in  W /,. .  For  notational  simplicity,  we  drop 
the  index  k ,  and  prove  the  strict  convexity  of  —  logdet(W)  along  any  feasible  direction  within  the  set  of  positive-definite 
matrices.  Let  G  be  a  feasible  direction  and  t  be  a  positive  scalar  such  that  W  +  f G  >  0.  Then  we  define  a  one-dimensional 
parametrization  of  —  logdet(W)  along  the  direction  G 

h2(t)  =  -logdet(W  +  £G). 


Using  properties  of  the  determinant  function  and  the  fact  that  W  is  positive-definite,  we  have 

h2(t)  =  -logdet(W1/2(I  +  £W~1/2GW~1/2)W1/2) 

=  -logdet(W)  -logdetjl  +  fW-^GW-1/2) 

=  —  logdet(W)  -  y^log(l  +  tXj), 

i 

where  Ads  are  the  eigenvalues  of  W_1/,2GW-1/2  and  the  last  step  of  the  above  procedure  is  due  to  the  fact  that  eigenvalues 
of  I  +  X  are  one  plus  the  eigenvalues  of  X.  Obviously  for  any  value  of  A,  the  function  —  log(l  +  tX,  )  is  convex  with  respect 
to  t  and  for  any  non-zero  A,  is  strictly  convex  in  t.  Since  G  is  non-zero  and  W  is  positive-definite,  it  follows  that  there  exists 
at  least  one  non-zero  Ai  which  means  that  —  JTlog(l  +  £Aj)  is  strictly  convex.  Thus,  h2(t)  is  strictly  convex  in  t.  ■ 


Lemma  2  Let  {(Q£,  be  a  stationary  point  of  (17),  then  the  point  is  a  stationary  point  of  (15).  Conversely, 

if  {Qfc}f=i  is  a  stationary  point  of  (15),  then  {(Q£,  is  a  stationary  point  of  (16),  where  W£  =  Q*)  1, 

ft  =  1,  2, ,  K. 
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Proof:  Let  us  use  if) i(Q,W)  and  La  ('  Q )  to  denote  the  objective  functions  of  (17)  and  (16)  respectively,  i.e., 

K  K 

Vh(Q,  W)  =  J2ak  (Wk9k(Q))  -  X! logdet(Wfc), 

k= 1  fc=l 

K 

1^2  (Q)  =  ^]logdet(5fc(Q)). 

fc  =  l 

Suppose  {Qfc,W^}^=1  is  a  stationary  point  of  (17).  Since  the  constraints  in  (17)  are  separable  in  the  variables,  we  have 

Tr(VQfc^(Q%W*)T(Q,-Q*))  >  0,  k=l,2,...,K,  (31) 

Tr  (VWfcV,i(Q*)  W*)T(Wfc  —  W^))  >  0,  k  =  l,2,...,K,  (32) 

for  any  feasible  point  {Qfc,  By  taking  the  gradient  of  •)  with  respect  to  W /.  and  further  simplifying  (32),  we 

get 

Tr  [(gk( Q*)  -  (W*)-1)(Wfe  -  W*)]  >  0. 

Since  this  inequality  holds  for  any  W*,  it  follows  that 

W*k=gk(Q*)-\  V  k  =  1,2, . . .  ,K.  (33) 


Fix  any  index  l  and  let  us  use  qm,n  to  denote  the  (m,n)-th  entry  in  Q<  .  Differentiating  using  chain  rule,  we  obtain 


dfi 


(Q*,W») 


T  (wT*  ^9k(Q)  \ 


k= 1 

K 


rr  (  m*\-\  ®9k{ Q)\ 

/  . afcTr  (  3fc(Q  )  a  ) 


fc= 1 

=  dip2 

^Qrn.n 

where  the  second  equality  follows  from  (33).  This  further  implies  that 


r)n  /  (Q*,w*) 

U(im,n  / 


Tr(VQtt/>2(Q*)T(Qfc-QD)  =Tr(VQ>1(Q*,W*)T(Qfc-Q^))  >0  V  k  =  1,2, . . .  ,K,  (34) 

which  guarantees  the  stationarity  of  the  point  { Qj.  }/Kl  |  for  (16).  Furthermore,  since  the  objective  function  in  (16)  and  the 
objective  function  in  (15)  only  differ  in  sign,  the  stationarity  of  >n  (16)  is  equivalent  to  the  stationarity  of  { Q(: 

for  (15).  To  prove  the  converse,  we  can  define  W£  =  gk( Q*)-1,  A:  =  1,2.....  K ,  and  simply  reverse  the  above  argument  to 
show  that  (31)  and  (32)  hold.  This  further  implies  the  stationarity  of  the  point  {(Q^,  for  (17).  ■ 
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